One Model To Learn Them All

نویسندگان

  • Lukasz Kaiser
  • Aidan N. Gomez
  • Noam Shazeer
  • Ashish Vaswani
  • Niki Parmar
  • Llion Jones
  • Jakob Uszkoreit
چکیده

Deep learning yields great results across many fields, from speech recognition, image classification, to translation. But for each problem, getting a deep model to work well involves research into the architecture and a long period of tuning. We present a single model that yields good results on a number of problems spanning multiple domains. In particular, this single model is trained concurrently on ImageNet, multiple translation tasks, image captioning (COCO dataset), a speech recognition corpus, and an English parsing task. Our model architecture incorporates building blocks from multiple domains. It contains convolutional layers, an attention mechanism, and sparsely-gated layers. Each of these computational blocks is crucial for a subset of the tasks we train on. Interestingly, even if a block is not crucial for a task, we observe that adding it never hurts performance and in most cases improves it on all tasks. We also show that tasks with less data benefit largely from joint training with other tasks, while performance on large tasks degrades only slightly if at all.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Development of a Model of Political and Socio-Economic Factors Impacting Iranian EFL Learners’ Motivation to Learn English

The present study was an attempt to identify a model of the political and socio-economic factors influencing Iranian EFL learners’ motivation to learn English. To achieve this, 20 EFL learners were interviewed about their motivation of learning English and based on these interviews, a questionnaire was designed and piloted among 221 EFL learners. Exploratory factor analysis w...

متن کامل

Designing an Optimal Pattern of General Medical Course Curriculum: an Effective Step in Enhancing How to Learn

Introduction: In today's world with a vast amount of information and knowledge, medical students should learn how to become effective physicians. Therefore, the competencies required for lifelong learning in the curriculum must be considered. The purpose of this study was to present a desirable general medical curriculum with emphasis on lifelong learning. Methods: The present study was Mixe...

متن کامل

Evaluation Psychometric Characteristics of the Persian Version of the Colorado Learning Attitudes about Science Survey Using polytomous Item Response Model

Goal: Researchers in the field of science education believe that peoplechr(chr('39')39chr('39'))s attitudes about learning will have a significant impact on their future learning and what they learn from science will not be unrelated to their views and attitudes. Accordingly, most questionnaires have been developed to measure attitudes toward science, especially about physics learning attitudes...

متن کامل

Co-integration Relation for Oil Production in Alternative Hypotheses about OPEC Behavior

This study estimates three hypotheses of OPEC behavior: market-sharing, target revenue and competitive model for the period 1980 to 2000 for all OPEC courtiers except Iraq. To examine co-integration relation for oil production, we use ADF test in OLS estimation. Also we use ARDL approach to examine these hypotheses and the long run relationship of them. Results indicate none of three hypotheses...

متن کامل

دانش‌آموزان دیرآموز: ارزیابی پویا، ویژگی‌ها، شناسایی، شیوه‌های تدریس و بهبود ظرفیت یادگیری

   A slow learner (SL) student is one who has the ability to learn necessary academic skills but at a learning rate and depth is below average of the same age peers. A very big problem that teachers faces is the difficulty to interaction with the SL students. It is a challenging task for the teachers to tackle SL students and to make them learn the academic subjects. Handling them in ...

متن کامل

Toward the Development of a Model of Political and Socio-Economic Factors Impacting the Motivation of Iranian EFL learners to Learn English

The present study was an attempt to identify a model of the political and socio-economic factors influencing Iranian EFL learners’ motivation to learn English. To achieve this goal, 20 EFL learners studying at the Iran Language Institute in Darab, Iran were invited for an interview session and a questionnaire was then designed based on the interview findings. Piloted testing of the survey was c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1706.05137  شماره 

صفحات  -

تاریخ انتشار 2017